Clustering-regression-ordering steps for knowledge discovery in spatial databases
نویسندگان
چکیده
Precision agriculture is a new approach to farming in which environmental characteristics at a sub-field level are used to guide crop production decisions. Instead of applying management actions and production inputs uniformly across entire fields, they are varied to match site-specific needs. A first step in this process is to define spatial regions having similar characteristics and to build local regression models describing the relationship between field characteristics and yield. From these yield prediction models, one can then determine optimum production input levels. Discovery of “similar” regions in fields is done by applying the DBSCAN clustering algorithm on data from more than one field ignoring spatial attributes (x and y coordinates) and the corresponding yield values. Using these models, constructed on training field regions of obtained clusters, we aim to achieve better prediction on identified regions than using global prediction models. The experimental results on real life agriculture data show observable improvements in prediction accuracy, although there are many unresolved issues in applying the proposed method in practice.
منابع مشابه
Clustering and Knowledge Discovery in Spatial Databases
In the past decades, clustering has been widely used in areas such as pattern recognition, data analysis, and image processing. Recently, clustering has been recognized as a useful method for knowledge discovery in spatial databases. To eeciently detect clusters from large spatial databases with limited amount of available memory, special database techniques have been developed. In this article...
متن کاملDistributed clustering and local regression for knowledge discovery in multiple spatial databases
Many large-scale spatial data analysis problems involve an investigation of relationships in heterogeneous databases. In such situations, instead of making predictions uniformly across entire spatial data sets, in a previous study we used clustering for identifying similar spatial regions and then constructed local regression models describing the relationship between data characteristics and t...
متن کاملA Database Interface for Clustering in Large Spatial Databases
Both the number and the size of spatial databases are rapidly growing because of the large amount of data obtained from satellite images, X-ray crystallography or other scientific equipment. Therefore, automated knowledge discovery becomes more and more important in spatial databases. So far, most of the methods for knowledge discovery in databases (KDD) have been based on relational database s...
متن کاملShape-based Clustering Of Enterprise CAD Databases
Cluster analysis is a primary data mining method for knowledge discovery in spatial databases, where, the goal is to find ‘natural’ groups in a dataset based on a similarity or dissimilarity function for pairs of objects. With the number and size of spatial databases in various domains growing rapidly over the last couple of decades, methods for automated knowledge discovery in these datasets i...
متن کاملKnowledge Discovery in Spatial Databases
Both, the number and the size of spatial databases, such as geographic or medical databases, are rapidly growing because of the large amount of data obtained from satellite images, computer tomography or other scientific equipment. Knowledge discovery in databases (KDD) is the process of discovering valid, novel and potentially useful patterns from large databases. Typical tasks for knowledge d...
متن کامل